Motivating Non - Negative Matrix Factorizations ∗
نویسنده
چکیده
Given a vector space model encoding of a large data set, a usual starting point for data analysis is rank reduction [1]. However, standard rank reduction techniques such as the QR, Singular Value (SVD), and Semi-Discrete (SDD) decompositions and Principal Component Analysis (PCA) produce low rank bases which do not respect the non-negativity or structure of the original data. Non-negative Matrix Factorization (NMF) was suggested in 1997 by Lee and Seung (see [6] and [7]) to overcome this weakness without significantly increasing the error of the associated approximation. NMF has been typically applied to image and text data (see for example: house and facial images in [6], handwriting samples in [9]), but has also been used to deconstruct music tones [4]. The additive property resulting from the non-negativity constraints of NMF has been shown to result in bases that represent local components of the original data (i.e.doors for houses, eyes for faces, curves of letters and notes in a chord). In this paper, we intend to motivate the application of NMF techniques (with noted corrections) to other types of data describing physical phenomena. The contents of this paper are as follows. Section 2 details the NMF objective functions and update strategies used here and in practice. In Section 3 we illustrate both the error and resulting basis for text and image collections. We then turn to the specific example of NMF performed on remote sensing data in Section 4. We emphasize recently proposed NMF alterations and compare the output obtained with the remote sensing literature. We conclude in Section 5 by describing future work for this promising area.
منابع مشابه
Nonnegative Matrix Factorizations Performing Object Detection and Localization
We study the problem of detecting and localizing objects in still, gray-scale images making use of the part-based representation provided by non-negative matrix factorizations. Non-negative matrix factorization represents an emerging example of subspace methods which is able to extract interpretable parts from a set of template image objects and then to additively use them for describing indivi...
متن کاملSurvey on Probabilistic Models of Low-Rank Matrix Factorizations
Low-rank matrix factorizations such as Principal Component Analysis (PCA), Singular Value Decomposition (SVD) and Non-negative Matrix Factorization (NMF) are a large class of methods for pursuing the low-rank approximation of a given data matrix. The conventional factorization models are based on the assumption that the data matrices are contaminated stochastically by some type of noise. Thus t...
متن کاملRiordan group approaches in matrix factorizations
In this paper, we consider an arbitrary binary polynomial sequence {A_n} and then give a lower triangular matrix representation of this sequence. As main result, we obtain a factorization of the innite generalized Pascal matrix in terms of this new matrix, using a Riordan group approach. Further some interesting results and applications are derived.
متن کاملSparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis
MOTIVATION Many practical pattern recognition problems require non-negativity constraints. For example, pixels in digital images and chemical concentrations in bioinformatics are non-negative. Sparse non-negative matrix factorizations (NMFs) are useful when the degree of sparseness in the non-negative basis matrix or the non-negative coefficient matrix in an NMF needs to be controlled in approx...
متن کاملLearning with matrix factorizations
Matrices that can be factored into a product of two simpler matrices can serve as a useful and often natural model in the analysis of tabulated or highdimensional data. Models based on matrix factorization (Factor Analysis, PCA) have been extensively used in statistical analysis and machine learning for over a century, with many new formulations and models suggested in recent years (Latent Sema...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003